NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fair Learning with Private Demographic Data

Mozannar, Hussein; Ohannessian, Mesrob; Srebro, Nathan (July 2020, Proceedings of the 37th International Conference on Machine Learning, PMLR 2020)
Daumé, Hal; Singh, Aarti (Ed.)
Sensitive attributes such as race are rarely available to learners in real world settings as their collection is often restricted by laws and regulations. We give a scheme that allows individuals to release their sensitive information privately while still allowing any downstream entity to learn non-discriminatory predictors. We show how to adapt non-discriminatory learners to work with privatized protected attributes giving theoretical guarantees on performance. Finally, we highlight how the methodology could apply to learning fair predictors in settings where protected attributes are only available for a subset of the data.
more » « less
Full Text Available
Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits

Si, Nian; Zhang, Fan; Zhou, Zhengyuan; Blanchet, Jose. (August 2020, Proceedings of Machine Learning Research)
III, Hal Daumé; Singh, Aarti (Ed.)
Policy learning using historical observational data is an important problem that has found widespread applications. However, existing literature rests on the crucial assumption that the future environment where the learned policy will be deployed is the same as the past environment that has generated the data{–}an assumption that is often false or too coarse an approximation. In this paper, we lift this assumption and aim to learn a distributionally robust policy with bandit observational data. We propose a novel learning algorithm that is able to learn a robust policy to adversarial perturbations and unknown covariate shifts. We first present a policy evaluation procedure in an ambiguous environment and also give a heuristic algorithm to solve the distributionally robust policy learning problems efficiently. Additionally, we provide extensive simulations to demonstrate the robustness of our policy.
more » « less
Full Text Available
Adversarial risk via optimal transport and optimal couplings

Pydi, Muni Sreenivas; Jog, Varun (July 2020, Proceedings of Machine Learning Research)
Daumé, Hal III; Singh, Aarti (Ed.)
Full Text Available
Learning Selection Strategies in Buchberger’s Algorithm

Peifer, Dylan; Stillman, Michael; Halpern-Leistner, Daniel (July 2020, Proceedings of the 37th International Conference on Machine Learning)
III, Hal Daumé; Singh, Aarti (Ed.)
Studying the set of exact solutions of a system of polynomial equations largely depends on a single iterative algorithm, known as Buchberger’s algorithm. Optimized versions of this algorithm are crucial for many computer algebra systems (e.g., Mathematica, Maple, Sage). We introduce a new approach to Buchberger’s algorithm that uses reinforcement learning agents to perform S-pair selection, a key step in the algorithm. We then study how the difficulty of the problem depends on the choices of domain and distribution of polynomials, about which little is known. Finally, we train a policy model using proximal policy optimization (PPO) to learn S-pair selection strategies for random systems of binomial equations. In certain domains, the trained model outperforms state-of-the-art selection heuristics in total number of polynomial additions performed, which provides a proof-of-concept that recent developments in machine learning have the potential to improve performance of algorithms in symbolic computation.
more » « less
Full Text Available
Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates

Liu, Yang; Guo, Hongyi (January 2020, Proceedings of the 37th International Conference on Machine Learning)
Daumé III, Hal; Singh, Aarti (Ed.)
Learning with noisy labels is a common challenge in supervised learning. Existing approaches often require practitioners to specify noise rates, i.e., a set of parameters controlling the severity of label noises in the problem, and the specifications are either assumed to be given or estimated using additional steps. In this work, we introduce a new family of loss functions that we name as peer loss functions, which enables learning from noisy labels and does not require a priori specification of the noise rates. Peer loss functions work within the standard empirical risk minimization (ERM) framework. We show that, under mild conditions, performing ERM with peer loss functions on the noisy data leads to the optimal or a near-optimal classifier as if performing ERM over the clean training data, which we do not have access to. We pair our results with an extensive set of experiments. Peer loss provides a way to simplify model development when facing potentially noisy training labels, and can be promoted as a robust candidate loss function in such situations.
more » « less
Full Text Available
Deep Molecular Programming: A Natural Implementation of Binary-Weight ReLU Neural Networks

Vasic, Marko; Chalk, Cameron; Khurshid, Sarfraz; Soloveichik, David (January 2020, Proceedings of the 37th International Conference on Machine Learning)
III, Hal Daumé; Singh, Aarti (Ed.)
Embedding computation in molecular contexts incompatible with traditional electronics is expected to have wide ranging impact in synthetic biology, medicine, nanofabrication and other fields. A key remaining challenge lies in developing programming paradigms for molecular computation that are well-aligned with the underlying chemical hardware and do not attempt to shoehorn ill-fitting electronics paradigms. We discover a surprisingly tight connection between a popular class of neural networks (binary-weight ReLU aka BinaryConnect) and a class of coupled chemical reactions that are absolutely robust to reaction rates. The robustness of rate-independent chemical computation makes it a promising target for bioengineering implementation. We show how a BinaryConnect neural network trained in silico using well-founded deep learning optimization techniques, can be compiled to an equivalent chemical reaction network, providing a novel molecular programming paradigm. We illustrate such translation on the paradigmatic IRIS and MNIST datasets. Toward intended applications of chemical computation, we further use our method to generate a chemical reaction network that can discriminate between different virus types based on gene expression levels. Our work sets the stage for rich knowledge transfer between neural network and molecular programming communities.
more » « less
Full Text Available
Stochastic Gradient and Langevin Processes

Cheng, Xiang; Yin, Dong; Bartlett, Peter L.; Jordan, Michael (January 2020, Proceedings of the 37th International Conference on Machine Learning)
Daumé III, Hal; Singh, Aarti (Ed.)
We prove quantitative convergence rates at which discrete Langevin-like processes converge to the invariant distribution of a related stochastic differential equation. We study the setup where the additive noise can be non-Gaussian and state-dependent and the potential function can be non-convex. We show that the key properties of these processes depend on the potential function and the second moment of the additive noise. We apply our theoretical findings to studying the convergence of Stochastic Gradient Descent (SGD) for non-convex problems and corroborate them with experiments using SGD to train deep neural networks on the CIFAR-10 dataset.
more » « less
Full Text Available
On Approximate Thompson Sampling with Langevin Algorithms

Mazumdar, Eric; Pacchiano, Aldo; Ma, Yian; Jordan, Michael; Bartlett, Peter (January 2020, Proceedings of the 37th International Conference on Machine Learning)
Daumé III, Hal; Singh, Aarti (Ed.)
Thompson sampling for multi-armed bandit problems is known to enjoy favorable performance in both theory and practice. However, its wider deployment is restricted due to a significant computational limitation: the need for samples from posterior distributions at every iteration. In practice, this limitation is alleviated by making use of approximate sampling methods, yet provably incorporating approximate samples into Thompson Sampling algorithms remains an open problem. In this work we address this by proposing two efficient Langevin MCMC algorithms tailored to Thompson sampling. The resulting approximate Thompson Sampling algorithms are efficiently implementable and provably achieve optimal instance-dependent regret for the Multi-Armed Bandit (MAB) problem. To prove these results we derive novel posterior concentration bounds and MCMC convergence rates for log-concave distributions which may be of independent interest.
more » « less
Full Text Available

Search for: All records